N87-18864 

OPTIMAL CONTROL CONCEPTS IN DESIGN SENSITIVITY ANALYSIS 


Ashok D. Belegundu 
Mechanical Engineering Department 
The Pennsylvania State University 
University Park, PA 


ABSTRACT 

In this paper, a close link is established between open loop optimal 
control theory and optimal design by noting certain similarities in the gradient 
calculations. The resulting benefits include a unified approach, together with 
physical insights in design sensitivity analysis, and an efficient approach for 
simultaneous optimal control and design. Both matrix displacement and matrix 
force methods are considered, and results are presented for dynamic systems, 
structures, and elasticity problems. 


1 . INTRODUCTION 

Considerable interest is being shown in recent years on control of flexible 
systems such as robots and space structures. In control theory and optimal 
control in particular, the geometry (dimensions and shape) is given, and the 
task is to develop a control law so as to ensure proper operation of the system 
in an uncertain environment. In design, and optimal design in particular, the 
task is to determine the geometry. Evidently, at least in the preliminary 
design stages, there is interaction between optimal control and optimal design. 
There is a need for better understanding of this interaction. In this paper, a 
close link is established between these two disciplines. Specifically, the 
similarity of the sensitivity calculations and adjoint equations is examined. 
Practical benefits and new possibilities are discussed. Dynamic systems, 
structures, and continuum elasticity models are considered. Both displacement 
and matrix force methods of structural analysis are treated. 


2. THE LAGRANGE MULTIPLIER RULE FOR CALCULATING SENSITIVITY COEFFICIENTS 

The Lagrange multiplier rule is a well-known method for obtaining 
optimality conditions in the presence of constraints. The rule, however, serves 
equally well in obtaining expressions for sensitivity coefficients (or 
derivatives) of implicit functions, as shown below. 

Consider the scalar valued function f = f (x,b), where x is an (nxl) 
vector of 'state* variables and b is a (kxl) vector of design variables. The 
function f is implicit in that for every vector b, x satisfies the state 
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equation 


( 1 ) 


g (x, b) = 0 

where g is (nxl) vector function. It is desired to obtain the sensitivity 
vector~df /db . In design of structural and mechanical systems, f often 
represents the stress or displacement of a point and Eq. (l) is the equation 
of equilibrium. To illustrate the Lagrange multiplier rule for calculating 
df/db, we first form the scalar valued function H as 

H = f + xTg (2) 

where X is an (nxl) vector of Lagrange multipliers or 'adjoint' variables. 
Noting that f, g and H are functions of^x and__b, we have, upon differentiating 
H with respect to b, 


dH/db = 3H/3b + 3H/3x dx/db (3) 

The idea behind the Lagrange multiplier rule is to require that X satisfy the 
equations 


3H/3x = 0 (4) 

Assuming that 3g/3x is a nonsingular matrix — which is necessary for x to be 
a unique solution to Eq. (1) — and using Eq. (4), we can obtain X from the 
following 'adjoint equations': 

[3g/3x] T X = - 3f/9x T (5) 

Equation (3) now provides the result 

df/db = 3H/db (6) 


or, 


df/db = 3f/3b + X T 3g/3b. (7) 

In Eq. (7), the term g T 3X/3b does not appear because of Eq. (1). 

The fact that the Lagrange multiplier rule offers a unified approach to 
design sensitivity analysis has been discussed in Ref. 1. Further, the adjoint 
method of design sensitivity analysis given in Ref. 2 results in the exact same 
equations as obtained using the Lagrange multiplier rule. In this paper, the 
use of this rule to obtain expressions for sensitivity coefficients helps to 
establish a close link between optimal design and optimal control, as discussed 
in the next section. 
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3. OPTIMAL CONTROL AND OPTIMAL DESIGN 
Optimal Control 

To present the basic concepts, consider a dynamic system described by the 
following nonlinear differential equations 

x = q ( x ( t ) , u(t), t); x(t Q ) given, t Q < t < t f (8) 

where the 'state' x(t), an (nxl) vector function, is determined by the 
'control' u(t), a (kxl) vector function. Consider a performance index given by 
the scalar^f unctional 

ftf 

F = t J f (x(t), u(t), t) dt ( 9 ) 

o 

The optimal control problem is to find u(t) that minimizes (or maximizes) F 
[31. The Lagrange multiplier rule as discussed in the previous section, is used 
to do this. Adjoin the system in Eq. (8) to F with multiplier functions (or 
adjoint variables) A(t): 


F = t J f [f + A T (q - x)] dt (10) 

o 

If we define the scalar function H, the Hamiltonian, as 

H = f + X T q ( 11 ) 

and integrate the last term on the right side of Eq. (10) by parts, we obtain 

F = “ X T (t ) x (t ) + A T (t ) x (t ) 

I'-I - 0^0 

t T 

+ t J f [H + A x] dt (12) 

o 


Now, consider the variation in F due to variation in the control vector 
u(t) for fixed times t 0 and tf and fixed initial conditions, 


5 F = - A x Sx 




T 

OH/9x + A ) Sx dt 


+ 



■P 6u dt 
9u 


( 13 ) 
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Since it is tedious to determine the variations 5x(t) produced by a given 
Su(t), we choose the multiplier functions X(t) to'satisfy 


• T _ _ 3H _ 3f _ .T 
A ' ” 3x " " 3x ” - 3x 


( 14 ) 


with boundary conditions 


X T (t f ) = 0 

In view of the adjoint equations in (14) and (15), Eq. (13) yields 


6F 



3H/3u 6u dt 


(15) 


(16) 


The functions 3H/3u can be interpreted as impulse response functions since 
each component of 3H/3u represents the variation in F due to a unit impulse in 
the corresponding component of u at time t [ 3 ] - Furthermore, 3H/3u can be 
interpreted as the function-space gradient of F with respect to u. This last 
interpretation is useful when using gradient methods to extremize F. For 
example, choosing u(t) = -a 3H/3u corresponds to a steepest descent step to 
minimize F. 

Finally, it should be noted that setting 3H/3u = 0 yields the optimality 
conditions. In the special case when F is quadratic in x and u and Eqs. (8) 
are linear, the optimality conditions together with the state equations (8) and 
adjoint equations (14) and (15) can be solved in closed form, leading to the 
Ricatti equations, which are very attractive in closed loop control since the 
feedback law is independent of the state vector x and can be computed ’off- 
line' . 

Optimal Design 


In optimal design of mechanical systems, it is required to obtain the 
sensitivity vector dF/db where F is a cost or constraint functional of the 
form 


F 



f(x(t), b, t) dt 


(17) 


with b a ( kx 1 ) vector of design variables. For example, F represents a time- 
averaged performance measure of a vehicle traversing a rough terrain. Most 
gradient-based nonlinear programming codes require input of the vector 
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dF/db. In Eq. (17), for a given b, x should satisfy the equations of motion 
given by 


x = g ( x ( t ) , b, t); x(t Q ) given, t Q < t < t f 


(18) 


As before, the use of the Lagrange multiplier rule requires the function 


F 



[f + A T (q - x)] dt 


(19) 


Integrating the last term on the right side of Eq. (19) by parts yields 


F 


- A T ( t ) x ( t _ ) + A T (t ) x(t ) 
~ X ~ I ~ o~o 

f fc f . T 

+ t J [H + A x] dt 

o 


( 20 ) 


where H is defined in Eq. (11). Now, consider the variation in F due to 
variations (or differentials) in b for fixed times t Q and t f , and fixed 
initial conditions: 


5 F 



T 

(9H/3x + A ) Sx dt 


+ 



9H/9b 6b dt 


( 21 ) 


If we choose A(t) to satisfy the same adjoint equations as in the optimal 
control problem in (Hi) and (15), we obtain 


6F 



9H/9b 6b dt 


( 22 ) 


Since b is not a function of time, Eq. (22) yields the sensitivity coeffi- 
cient vector 
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dF/db 



3H/3b dt 


(23) 


Expressions as in Eq. (23) can then be fed into nonlinear programming codes to 
obtain improved design vectors b. The main emphasis here, however, is to 
show that the calculation of design sensitivity vectors is simply a special case 
of open loop optimal control. That is, treating the control variables u(t) as 
design variables enables us to obtain expressions for the sensitivity vectors. 

The following advantages result from this observation: 

(1) A general approach to design sensitivity analysis is 
established . 

(2) Physical significance of the adjoint variables is established. In 
particular, in the above discussion, the functions 3H/9u are 
interpreted to be influence functions. The importance of such a 
physical interpretation in structural design is discussed 
subsequently. 

(3) The fact that the adjoint equations are the same in the optimal 
control and optimal design problems motivates an efficient gradient 
approach for simultaneous handling of control variables and design 
parameters. That is, functionals of the form 


F 


t J f f (x(t) , u(t) , b, t) 

o 


(24) 


where both control variables u(t) and design parameters b have to be opti- 
mally chosen, can be treated efficiently. Such problems may arise, for example, 
when designing both a control law as well as determining the dimensions and 
shape for a robot or for a flexible space structure. 


4. STRUCTURES 

Matrix Displacement Method 


The general results discussed in the preceding section lead to special 
insights when applied to structural systems. Consider a scalar function 

f s f (x, b) (25) 

where f typically represents the stress or displacement at some point in the 
structure, b is a (kxl) vector of design variables, and x is the nodal 


displacement vector. If a finite element model of the structure exists, then 
the (nxl) vector x is obtained from the matrix displacement (finite element) 
equations 


K(b)x = P(b) (26) 

where K is an (nxn) structural stiffness matrix and P is a vector of applied 
nodal loads. 

The importance of applying optimal control concepts to structural systems 
described by (25) and (26) will now be shown. The sensitivity vector 
df/db will be obtained by using the Lagrange multiplier rule. Define the 
function H as 

H = f + X T (P - K x) (27) 

where X is the (nxl) adjoint vector. The variation of H due to a variation in 
b is given by 

6H = 9H/9b <5b + 9H/9x 6x (28) 

Choosing X to satisfy 


9H/9x =0 (29) 

which can also be written as K X = 9f/9x T , we have from Eq. (28), 

Sf = 9H/9b 6b (30) 

which yields 

df/db = 9H/9b (31) 

Now, in the foregoing analysis, let us consider the variation in H due to a 
variation in P. That is, we consider variations in the ’control' vector 
P instead of the design vector b. We have 

6H = 9H/9P 6P + 9H/9x 6x (32) 

Choosing X to satisfy the adjoint equation in (29), and noting that f and K 
do not depend explicitly on P for linear structures, we get 

6f = X T 6P (33) 

or, 

X T = df/dP ( 34 ) 

Since the adjoint equations in (29) are the same regardless of whether the 
fundamental variation is in b or P, Eq. (34) shows that the adjoint vector 
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X used in structural design sensitivity analysis represents the sensitivity 
of the function f to variations in the applied loads p. Further, if f is 
linear in P, then X^ = value of f due to Pj = 1 . In civil engineering, X is the 
influence coefficient vector associated with the function f, as discussed in 
Ref. 4. Further, since Eqs. (29) can be written as 

K X = 9f/9x T ( 35 ) 

we can think of X as a displacement vector associated with the load vector 
9f/9x. This motivates the use of element shape functions to obtain expres- 
sions for X within the elements from knowing the nodal values. Thus, we can 
write 

XU) = l X. N.U) (36) 

i 1 1 

where Xj[ are the nodal values obtained by solving Eq. (35) and are the 
familiar shape functions used in finite element analysis. 

The beam in Fig. 1(a) is solved to illustrate this. A finite element model 
of the beam is shown in Fig. 1(b). The function f is taken to be the moment at 

support b. The adjoint vector X, representing the values of f due to unit 

loads along each degree of freedom, is obtained by solving Eq. (35). Equation 

(36) is used to obtain expressions for X along the beam, which is used to draw 

the influence line as shown in Fig. 1(c). The results are in agreement with 
those in Ref. 5, and show that the adjoint method is a new and powerful approach 
for determining influence lines. 

Some other interesting aspects relating to Eqs. (34) and (35) may now be 
noted. If we let f be the strain energy function U given by 

U = x T K x (37) 

then Eq. (35) yields K X = K x, from which X = x. Equation (34) then yields 

x T = dU/dP (38) 

which is a discrete version of Castiglaiano' s theorem for linear structures. 

1 T T 

Also, letting f = - x Kx-x F = potential energy, results in X = 0 and dir/dP 
= 0, which is a statement of the minimum potential energy theorem^ 

Matrix Force Method 


The systematic use of the Lagrange multiplier rule or adjoint method for 
design sensitivity analysis and physical significance of adjoint variables, 
which was discussed in the context of displacement finite element analysis, will 
now be extended to structures analyzed by the the matrix force method. 
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For indeterminate structures, the equilibrium equations in the matrix force 
method take the form [6] 


n F Z + n x X = P (39) 

where x is a vector of ’redundant' or independent forces (and reactions), 

F is a vector of dependent forces, and P is the vector of externally applied 
forces. The redundants X are obtained from compatibility conditions of the 
f orm 

X = C(b) p + d(b) (Uo) 

Consider now a function f (b, F, X). Note that matrices C and d also depend 
on the design vector b. Form the~f unction H as 


H = f (b, F, X) + X 1 (P - n F - n X) 

^ ~ *'• "”r — "“X ** 

+ y T (X - C P - d) 

Consider the variation in H due to a variation in b: 

6H = 3f/3b 6b + (3f/3F - X T nj 6F 

— — p — 

+ (3f/3X - X T n + p T ) 6X - p T (6C*P + 6d) 


Choosing \ to satisfy 


T T 

nt, X = 3f/3F 
r - ■" 


and letting 

y T = A T n x - 3f/3X 

we have 

6f -= 3f/3b 6b - y T (6C*P + 6d) 
from which the sensitivity vector is obtained as 


( 1 + 1 ) 


(42) 

(43) 

(44) 

(45) 


df/db = 3f/3b - y T 3/3b (C P + d) (1+6) 

The physical significance of the adjoint variables X and y is obtained in the 
usual manner by considering the variation of H in Eq. (4l), due to a variation 
6P is P. Upon defining X as in Eq. (43) and j+ as in Eq. (44), we get 
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5f = (X T - y T C) 6P (1+7) 

Thus, (X - C T y) is now the influence coefficient vector. 

For statically determinate structures, the terms n x , X and y vanish, and 
Eq. (1+7) becomes 


6f = X T 6P (1+8) 

which is analogous to Eq. (33) that was obtained in the displacement method of 
analysis. In this case, X^ = value of f due to Pj = 1 , provided f is linear and 
homogeneous in P. 

5. ELASTICITY 

This section will focus on elasticity problems. Consider a functional 


= / f (o. . ) dfl 

Q J 1J 


(1+9) 


where ojj is the stress tensor, Q is the domain of the elastic body, and the 
equilibrium equations in variational form are 


(o.. (u) e.. (<}>) dQ = U. T. dr + <(>. B. 

) ij ij „ J i i J i i 


dQ 


(50) 


Equation (50) is satisfied for every displacement field $ satisfying ^ = 0 on 
I’ i , u is the actual displacement field due to traction forces Tj and body forces 
, and kinematic boundary conditions u = 0 is imposed on a portion F-| of the 
total boundary. Equation (50) is simply the principle of virtual work, with 
(c|>) being the virtual strain due to a kinematically admissible virtual 
displacement tj>. As before, form the functional H as 


[f(o. .) - o..(u) e. . (X)] dQ + J X. T. dr + JX. B. 
q J 1J 1J 1J r 1 1 n 1 1 


dQ ( 51 ) 


where X satisfies X = 0 on Ti . Consider a variation 6T^ in T^ , and 6B^ in B^ , 
and let ajj (v) and fjj(v) be the corresponding variations in stress and strain, 
Lvely. The variat 


respecti\ 


variation in H is given by 


6H = 


1 [3f/3a. . 0 . . (v) - 0 .. (v) £, . (X)] dQ 
S ij ij 


ij 


ij 


i h ST i dr * Jh 


SB. dQ 
1 


(52) 
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Equation (52) holds true for all kinematically admissible X, anG consequently 
holds true for a X determined from the following adjoint equations: 


jo (X) e (!P) da = /af/5o. . o.. OP) da 


ij 


'ij 


6F 


dft « J 
n 

3f/3o. . 
ij 

on r 1# 

Since 

da = J 

a 

yields 

r „ij (*> 

6T . + 
1 

J»i 6B 


ij 


(53) 


(5M 


Equation (5k) is essentially a variational principle. If we let the functional 
F be the complementary strain energy density, that is, we let 


/ ., 0. . 

[/ 1J e. . do. .] da 

a 0 1J 1J 

then Eq. ( 5 *+) yields (upon using Leibnitz's rule) 


) e. . 6a. . da = 
i 1J 1J r 


f X. 6T . dr + / X. 5B. 
J 1 1 „ 1 1 

a 


da 


(55) 


(56) 


which is the principle of complementary virtual work [ 7 ]. Finally, sensitivity 
expressions can be readily obtained if variation of H due to design variations 
is considered as done in previous sections. This approach holds true for 
a changing domain, as in shape optimal design [1]. From Eq. (5I+), we can see 
that Xj at a point represents the value of F due to a unit load at that point. 
This fact can be written in terms of Green's function as 


X = J f (0 (G )) da (57) 

a J 

where the Green's function G is the displacement field due to a unit load. 
6. FUTURE WORK 

In both optimal control and optimal design, it is shown that the 
Hamiltonian function and the Lagrange multiplier rule play a similar role. 
Optimal control theory helps to obtain a unified framework for design 
sensitivity analysis and physical understanding of adjoint variables. Some 
areas which may merit future investigation are noted below. 
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1. This work motivates an efficient gradient approach for optimal design 
of systems with control in mind. That is, both control variables and 
geometric design parameters can be considered simultaneously in the 
preliminary design stages. 

2. In structures, the adjoint method provides a powerful method for 

constructing influence lines in the framework of finite element 
analysis. Also, the equation X T = df/dP can be used to design 
structures which are insensitive to loads by minimizing x 2 i, or can 

be used to optimally locate the loads for maximum utilization of the 
structure, by maximizing X T P subject to suitable constraints. 

3. The stability analysis of the adjoint equations that has been carried 
out extensively in optimal control theory may turn out to be of 
importance to the design engineer. 
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